智能论文笔记

Learning Obstacle-Avoiding Lattice Paths using Swarm Heuristics: Exploring the Bijection to Ordered Trees

Victor Parque

分类：机器人 | 人工智能 | 神经与进化计算

2022-09-12

晶格路径是在离散/网格图中有效导航的功能实体。本文提出了一种新方案，以最大的效率生成无碰撞的晶格路径，该方案利用双界有序的树木对生根的树木产生了最大的效率，从而使一维搜索问题呈现。我们使用十个最先进和相关性质启发的群体启发式的计算研究在带有凸面和非凸线几何的障碍物的导航方案中显示出可行性和效率在呈现无碰撞的晶格路径方面。我们认为，我们的计划可能会发现在离散地图中计划和组合优化的快速算法中的用途。

translated by 谷歌翻译

Towards Hexapod Gait Adaptation using Enumerative Encoding of Gaits: Gradient-Free Heuristics

Victor Parque

分类：机器人 | 人工智能 | 神经与进化计算

2022-09-01

希望将多条机器人系统对变化条件的有效改编有望使新见解对机器人控制和运动产生新的见解。在本文中，我们研究了赫克萨波德步态的枚举（阶乘）编码的性能前沿，以便快速恢复到腿部失败的条件。我们使用五个自然启发的无梯度优化启发式方法的计算研究表明，有可能实现可行的恢复步态策略，从而实现最小的偏差，并通过一些评估（试验）来实现所需的运动指令。例如，可以生成可行的恢复步态策略，达到2.5厘米。（10厘米）平均相对于指挥方向进行40-60（20）/试验的偏差。我们的结果是有可能有效适应新条件，并进一步探索机器人运动问题适应的规范表示。

translated by 谷歌翻译

HTML版本

A Study on Broadcast Networks for Music Genre Classification

Ahmed Heakl , Abdelrahman Abdelgawad , Victor Parque

分类：人工智能

2022-08-25

由于对音乐流媒体/推荐服务的需求增加以及音乐信息检索框架的最新发展，音乐流派分类（MGC）引起了社区的关注。但是，已知基于卷积的方法缺乏有效编码和定位时间特征的能力。在本文中，我们研究了基于广播的神经网络，旨在提高一小部分参数（约180k）下的本地化和概括性，并研究了12个广播网络的变体，讨论了块配置，汇总方法，激活功能，归一化的效果机理，标签平滑，通道相互依赖性，LSTM块包含和成立方案的变体。我们使用相关数据集进行的计算实验，例如GTZAN，扩展宴会厅，Homburg和Free Music Archive（FMA），显示了音乐流派分类中最新的分类精度。我们的方法提供了洞察力，并有可能使音乐和音频分类启用紧凑且可推广的广播网络。

translated by 谷歌翻译

HTML版本

Choosing the Number of Topics in LDA Models -- A Monte Carlo Comparison of Selection Criteria

Victor Bystrov , Viktoriia Naboka , Anna Staszewska-Bystrova , Peter Winker

分类：自然语言处理 | 机器学习 | (统计)机器学习

2022-12-28

Selecting the number of topics in LDA models is considered to be a difficult task, for which alternative approaches have been proposed. The performance of the recently developed singular Bayesian information criterion (sBIC) is evaluated and compared to the performance of alternative model selection criteria. The sBIC is a generalization of the standard BIC that can be implemented to singular statistical models. The comparison is based on Monte Carlo simulations and carried out for several alternative settings, varying with respect to the number of topics, the number of documents and the size of documents in the corpora. Performance is measured using different criteria which take into account the correct number of topics, but also whether the relevant topics from the DGPs are identified. Practical recommendations for LDA model selection in applications are derived.

translated by 谷歌翻译

HAC-Net: A Hybrid Attention-Based Convolutional Neural Network for Highly Accurate Protein-Ligand Binding Affinity Prediction

Gregory W. Kyro , Rafael I. Brent , Victor S. Batista

分类：机器学习

2022-12-23

Applying deep learning concepts from image detection and graph theory has greatly advanced protein-ligand binding affinity prediction, a challenge with enormous ramifications for both drug discovery and protein engineering. We build upon these advances by designing a novel deep learning architecture consisting of a 3-dimensional convolutional neural network utilizing channel-wise attention and two graph convolutional networks utilizing attention-based aggregation of node features. HAC-Net (Hybrid Attention-Based Convolutional Neural Network) obtains state-of-the-art results on the PDBbind v.2016 core set, the most widely recognized benchmark in the field. We extensively assess the generalizability of our model using multiple train-test splits, each of which maximizes differences between either protein structures, protein sequences, or ligand extended-connectivity fingerprints. Furthermore, we perform 10-fold cross-validation with a similarity cutoff between SMILES strings of ligands in the training and test sets, and also evaluate the performance of HAC-Net on lower-quality data. We envision that this model can be extended to a broad range of supervised learning problems related to structure-based biomolecular property prediction. All of our software is available as open source at https://github.com/gregory-kyro/HAC-Net/.

translated by 谷歌翻译

VCNet: A self-explaining model for realistic counterfactual generation

Victor Guyomard , Françoise Fessant , Thomas Guyet , Tassadit Bouadi , Alexandre Termier

分类：人工智能 | 机器学习

2022-12-21

Counterfactual explanation is a common class of methods to make local explanations of machine learning decisions. For a given instance, these methods aim to find the smallest modification of feature values that changes the predicted decision made by a machine learning model. One of the challenges of counterfactual explanation is the efficient generation of realistic counterfactuals. To address this challenge, we propose VCNet-Variational Counter Net-a model architecture that combines a predictor and a counterfactual generator that are jointly trained, for regression or classification tasks. VCNet is able to both generate predictions, and to generate counterfactual explanations without having to solve another minimisation problem. Our contribution is the generation of counterfactuals that are close to the distribution of the predicted class. This is done by learning a variational autoencoder conditionally to the output of the predictor in a join-training fashion. We present an empirical evaluation on tabular datasets and across several interpretability metrics. The results are competitive with the state-of-the-art method.

translated by 谷歌翻译

When Not to Trust Language Models: Investigating Effectiveness and Limitations of Parametric and Non-Parametric Memories

Alex Mallen , Akari Asai , Victor Zhong , Rajarshi Das , Hannaneh Hajishirzi , Daniel Khashabi

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-20

Despite their impressive performance on diverse tasks, large language models (LMs) still struggle with tasks requiring rich world knowledge, implying the limitations of relying solely on their parameters to encode a wealth of world knowledge. This paper aims to understand LMs' strengths and limitations in memorizing factual knowledge, by conducting large-scale knowledge probing experiments of 10 models and 4 augmentation methods on PopQA, our new open-domain QA dataset with 14k questions. We find that LMs struggle with less popular factual knowledge, and that scaling fails to appreciably improve memorization of factual knowledge in the tail. We then show that retrieval-augmented LMs largely outperform orders of magnitude larger LMs, while unassisted LMs remain competitive in questions about high-popularity entities. Based on those findings, we devise a simple, yet effective, method for powerful and efficient retrieval-augmented LMs, which retrieves non-parametric memories only when necessary. Experimental results show that this significantly improves models' performance while reducing the inference costs.

translated by 谷歌翻译

Efficient Conditionally Invariant Representation Learning

Roman Pogodin , Namrata Deka , Yazhe Li , Danica J. Sutherland , Victor Veitch , Arthur Gretton

分类：机器学习 | (统计)机器学习

2022-12-16

We introduce the Conditional Independence Regression CovariancE (CIRCE), a measure of conditional independence for multivariate continuous-valued variables. CIRCE applies as a regularizer in settings where we wish to learn neural features $\varphi(X)$ of data $X$ to estimate a target $Y$, while being conditionally independent of a distractor $Z$ given $Y$. Both $Z$ and $Y$ are assumed to be continuous-valued but relatively low dimensional, whereas $X$ and its features may be complex and high dimensional. Relevant settings include domain-invariant learning, fairness, and causal learning. The procedure requires just a single ridge regression from $Y$ to kernelized features of $Z$, which can be done in advance. It is then only necessary to enforce independence of $\varphi(X)$ from residuals of this regression, which is possible with attractive estimation properties and consistency guarantees. By contrast, earlier measures of conditional feature dependence require multiple regressions for each step of feature learning, resulting in more severe bias and variance, and greater computational cost. When sufficiently rich features are used, we establish that CIRCE is zero if and only if $\varphi(X) \perp \!\!\! \perp Z \mid Y$. In experiments, we show superior performance to previous methods on challenging benchmarks, including learning conditionally invariant image features.

translated by 谷歌翻译

Collision Avoidance Testing of the Waymo Automated Driving System

Kristofer D. Kusano , Kurt Beatty , Scott Schnelle , Francesca Favaro , Cam Crary , Trent Victor

分类：机器人

2022-12-15

This paper describes Waymo's Collision Avoidance Testing (CAT) methodology: a scenario-based testing method that evaluates the safety of the Waymo Driver Automated Driving Systems' (ADS) intended functionality in conflict situations initiated by other road users that require urgent evasive maneuvers. Because SAE Level 4 ADS are responsible for the dynamic driving task (DDT), when engaged, without immediate human intervention, evaluating a Level 4 ADS using scenario-based testing is difficult due to the potentially infinite number of operational scenarios in which hazardous situations may unfold. To that end, in this paper we first describe the safety test objectives for the CAT methodology, including the collision and serious injury metrics and the reference behavior model representing a non-impaired eyes on conflict human driver used to form an acceptance criterion. Afterward, we introduce the process for identifying potentially hazardous situations from a combination of human data, ADS testing data, and expert knowledge about the product design and associated Operational Design Domain (ODD). The test allocation and execution strategy is presented next, which exclusively utilize simulations constructed from sensor data collected on a test track, real-world driving, or from simulated sensor data. The paper concludes with the presentation of results from applying CAT to the fully autonomous ride-hailing service that Waymo operates in San Francisco, California and Phoenix, Arizona. The iterative nature of scenario identification, combined with over ten years of experience of on-road testing, results in a scenario database that converges to a representative set of responder role scenarios for a given ODD. Using Waymo's virtual test platform, which is calibrated to data collected as part of many years of ADS development, the CAT methodology provides a robust and scalable safety evaluation.

translated by 谷歌翻译

NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior

Wenjing Bian , Zirui Wang , Kejie Li , Jia-Wang Bian , Victor Adrian Prisacariu

分类：计算机视觉

2022-12-14

Training a Neural Radiance Field (NeRF) without pre-computed camera poses is challenging. Recent advances in this direction demonstrate the possibility of jointly optimising a NeRF and camera poses in forward-facing scenes. However, these methods still face difficulties during dramatic camera movement. We tackle this challenging problem by incorporating undistorted monocular depth priors. These priors are generated by correcting scale and shift parameters during training, with which we are then able to constrain the relative poses between consecutive frames. This constraint is achieved using our proposed novel loss functions. Experiments on real-world indoor and outdoor scenes show that our method can handle challenging camera trajectories and outperforms existing methods in terms of novel view rendering quality and pose estimation accuracy.

translated by 谷歌翻译